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(57) Abstract 




A polypeptide having immunological activity for use as a diagnostic reagent and/or a vaccine component for the HIV vi- 
rus. The polypeptide comprises a substantial portion of each of more than one of the constituent proteins coded for by the HIV- 
pol gene, namely HIV-pol protease, HIV-pol reverse transcriptase, HIV-pol RNase H and HIV-pol integrase. 
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POLYPEPTIDE HAVING T MMUNOT.OGTCAL ACTIVITY FOP TTg- 
DIAGNOST IC REAGFNT AND/OR VACCTNF. 

TECHNICAL FTET.D 

This invention relates to a polypeptide having 
5 immunological activity for use as a diagnostic reagent 
and/ or a vaccine component. 
BACKGROUND &RT 

Diagnostic kits for use in screening individuals for 

infection with human immunodeficiency virus (HIV) infection 

10 frequently include reagents comprising HIV antigens which 

are used to detect antibodies using known immunological 

techniques including ELISA, Western Blot, latex 

agglutination and immuno-luminescent and immuno-fluorescent 
techniques . 

15 The effectiveness of such techniques however depends 

upon selection of suitable immunological reagents and one 
particular difficulty which arises is that particular 
reagents are often specific to individual strains or groups 
of strains of HIV. Thus, for example, known diagnostic 

20 reagents based upon HIV-1 may fail to detect antibodies 
resulting from an infection of a patient with HIV-2. 

Similarly, in the production of vaccines designed to 
protect individuals against HIV infection, the use of 
antigens derived from one particular strain of HIV may fail 

25 to provide adequate protection against infection with other 
strains. 

It is an object of the present invention to overcome 

such problems. 

DISCLOSURE OF INVENTION 
30 It has now been found that the product of expressing a 

substantial part of the HIV-pol gene in a suitable host has 

antigenic properties which allows the above-mentioned 

problems to be overcome. 

Thus according to one aspect of the present invention 
35 there is provided the use as an antigenic reagent in the 

diagnostic test or as a vaccine component of a polypeptide 
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comprising a substantial portion of each of more than one of 
the constituent proteins coded for by the HIV-pol gene. 

Diagnostic kits and vaccines comprising said polypeptide 
form further aspects of the present invention. 
5 The HIV-pol gene codes for four enzymes, namely a 

protease, a reverse transcriptase, a ribonuclease referred 
to as RNAse H and an enzyme referred to as Integrase. 

It is believed that during infection of a T cell by HIV 
a full length precursor is expressed which is then cut up 
10 into the discrete proteins listed above. These have the 
following activities and (it is thought) act in the order 
indicated: - 

Protease Precursor Cleavage 

Reverse Transcriptase Preparation of viral DNA from viral 
15 RNA 

RNAse H Destruction of viral RNA leaving 

newly synthesised DNA 

Integrase Insertion of said DNA into host cell 

genome 

According to a preferred aspect of the present 
invention, said constituent proteins are enzymes coded for 
by the HIV-pol gene and the polypeptide thus comprises a 
substantial portion of each of a plurality of enzymes 
selected from HIV-pol protease, HIV-pol reverse transcrip- 
tase, HIV-pol RNAse H and HIV-pol Integrase. Most prefer- 
ably, the polypeptide comprises substantial portions of all 
four of said enzymes. 

In vivo , the initial product of expressing the HIV-pol 
gene is cleaved into its individual elements by the 
protease. The active site for proteolytic activity occurs 
adjacent the NH 2 -terminus of the expression product, 
corresponding to the 5* -end of the protease gene. 
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According to a preferred aspect of the present 
invention, the polypeptide omits at least that part of the 
amino acid sequence of the HIV-pol protease gene which codes 
for the active site responsible for proteolytic activity. By 
5 omitting this portion, the integrity of the polypeptide is 
maintained and it is less liable to degrade. 

BRIEF D ESCRIPTION OF DRAWTWrzg 

Figure 1 is a schematic diagram showing the procedure of 
Example 1; 

10 Figure 2 shows the results of electrophoresis tests 

carried out in the manner explained in Example 2; and 

Figure 3 is a graph showing the results of the 
experiments carried out in Example 3. 

BEST MODE FOR PA RRYING OTTT THF. TWVENTTDM 

15 The HIV-pol gene of several strains of HIV-l has been 

cloned and the corresponding amino acid sequences derived 
from the determined DNA sequences. The amino acid sequences 
of ten strains appear in the accompanying Table l at the end 
of this disclosure. m Table 1, the full sequence of strain 

20 HIV HXB2 is given, whereas for the other nine strains, only 
sequence differences are listed. As used herein, the term 
"constituent protein coded for by the HIV-pol gene" refers 
to a protein having sufficient amino acid homology with the 
sequence of HIV HXB2 appearing in the accompanying Table so 

25 as to result in antibodies raised against the protein cross- 
reacting with a polypeptide consisting of the precise amino 
acid sequence of HIV HXB2. 

The HIV-pol gene can be expressed to produce the desired 
polypeptide by various techniques, e.g. some or all of the 

30 baculovirus techniques described in U.S. Patent 4,745,051 to 
Gale E. smith et al issued on May 17, 1988; Baculovirus 
Vectors for Expression of Foreign Genes by c. Yong Kang 
Advances in Virus Research, Vol. 35, pp 177-192, Academic 
Press Inc., 1988; A Manual of Methods for Baculovirus 

35 Vectors and Insect Cell Culture Procedures, Max D. Summers 
and Gale E. Smith, May 1987, Texas A&M University; and 
Baculoviruses as Gene Expression Vectors, Lois K. Miller, 
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Ann. Rev, Microbiol. 42., pp 177-1991; the disclosures of 
which are incorporated herein by reference. However our 
Canadian Patent Application Serial No. 591,908 filed on 23rd 
February 1989 (and equivalent British Patent Application 
5 Serial No. 89 04426.7 filed on February 27, 1989 and US 
Patent Application Serial No. 316,768 filed on February 28, 
1989) describes and claims an improved baculovirus 
expression system capable of producing foreign gene proteins 
at high levels and the use of this expression system is 

10 particularly preferred for expressing the polypeptide of the 
present invention. 

The process disclosed in our Canadian patent employs a 
recombinant baculovirus containing at least a major part of 
a polyhedrin gene promoter region, a transcription 

15 termination sequence of a polyhedrin structural gene, a 
foreign structural gene (e.g. an HIV-pol gene) having a 
translation start codon followed by coding sequences and a 
translation stop codon. The foreign gene is located between 
the promoter region and the termination sequence. 

20 Immediately upstream of the start codon there is a putative 
insect cell ribosome binding site for the polyhedrin gene 
effective for overcoming resistance of susceptible insect 
cells to express the foreign gene at a high level. The 
putative ribosome binding site comprises at least the final 

25 four nucleotides of the sequence 5 1 — ACCTATAAAT-3 1 . 

Example 3 of the Canadian application describes the 
production of the pol protein of HIV-1 in a baculovirus 
expression system based on Autoarapha californica 
nucleopolyhedrosis virus (ACNPV) and specifies that a 

3 0 recombinant baculovirus designated ACNPV-HIV-YK-pol has been 
deposited at the American Type Culture Collection of 123 01 
Parklawn Drive, Rockville MD 20852, USA under Accession No. 
ATCC VR 2233. Deposit was made on November 30, 1988. The 
disclosure of our Canadian Patent Application Serial No. 

35 591,908 is incorporated herein by reference. 

Utilising the procedures described in Example 3 of 
Canadian Patent Application Serial No. 591, 908, a polypeptide 
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comprising the protease, RNAse H and Integrase enzymes of 
HIV strain HIV-XB2 may be produced. 

The polypeptide can be used as a diagnostic reagent or 
vaccine component in ways known to persons skilled in the 
5 art, e.g. by the techniques indicated in the publication 
entitled Clinica, Testing for HIV and AIDS, The Next 
Five Years, George Street Publications Ltd. , Richmond, 
Surrey, UK, the disclosure of which is incorporated herein 
by reference. 

10 The invention is illustrated in more detail by the 

following Examples. Example 1 illustrates the production of 
a modified recombinant plasmid pUC18-Dpol3 having a 273 bp 
deletion at the 5 "-terminus and its expression as 
polypeptide lacking the first 91 amino acids at the 

15 NH 2 - terminus of the HIV-pol protease. Examples 2 and 3 relate 
to the expression of the polypeptide and its use as a 
diagnostic reagent. 

EXAMPLE 1 

Construction o f baculovirus transfer vector containing HIV-l 

20 Pol aene with 273 bo deletion at 5' terminng 

As illustrated in Figure 1, the Bglll and Sail fragment 
of plasmid pHXB-2D containing the HIV-l pol coding region 
was isolated and inserted into BamHI and Sail sites of 
pUC18. The resulting recombinant plasmid (pUCl8-Dpol 1) was 

25 cut with Sstl and dephosphorylated. A synthetic double- 
stranded crossover linker containing a Sstl cohesive end, a 
BamHI site, the putative insect Spodoptera frucriperda (SF9) 
cell ribosome binding site (P) and 15 nucleotides of the 
homology searching sequences which overlaps with the 5 1 

30 terminus of the pol gene was ligated at the Sstl site and 
transformed. The recombinant plasmid, (pUC18-Dpol 2) was 
isolated, digested with sPHl, dephosphorylated and ligated 
with another crossover linker DNA containing SphI cohesive 
end at the 3« terminus, BamHI site and 15 nucleotides of the 

35 homology searching sequences which recognise the 3 1 terminus 
of the pol gene. The resulting recombinant plasmid 
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(pUC18-Dpol 3) contains the putative SF9 cell ribosome 
binding site (P) followed with pol open reading frame 
starting with the first ATG (TI) codon (map unit 2357-2359) 
in the pol gene and the translation termination (TT) codon 
5 TAG (map unit 5093-5095) . This whole cassette was flanked 
with BamHl sites. The BamHl fragment was isolated and 
inserted into the BamHl site of the pAcYMl baculovirus 
transfer vector (pAcYMl-Dpol) . The pAcYMl-Dpol transfer 
vector DNA was used to co-transfect SF9 cells with wild type 
10 AcNPV DNA to isolated recombinant AcNPV HIV-YK pol virus. 

EXAMPLE 2 

Expression of pol gene pr oducts by recombinant baculovirusss 
Recombinant AcNPV-HIVWHpol contains an insert comprising 
essentially the whole DNA sequence of the HIV-pol gene (see 

15 Table 2 at the end of the present disclosure) . When 

expressed, the resulting full length gene product of the 
HIV-pol gene is "processed", i.e. the proteolytic active 
site of the HIV pol protease gene cleaves the protein into 
66 kD, 51 kD and 32 kD fragments. 

20 By way of comparison, recombinant AcNPV-HIVYKpol (see 

Table 3 at the end of the present disclosure) omits 
NH 2 -terminal amino acid sequences containing the proteolytic 
active site of the HIV-pol protease. When expressed, the 
resulting gene product is not "processed", i.e. the - 95 kD 

25 protein remains intact. 

The following experiments illustrate this. 
Uninfected S. fruaip erria (SF9) cells, or SF9 cell 
infected with recombinant baculoviruses AcNPV-HIVWHpol , 
AcNPV-HIVYKpol or with wild-type AcNPV, were harvested after 

3 0 72 hours of infection. Lysates of the infected or uninfected 
cells were electrophoresed in a 12% polyacrylamide Laemmli 
gel and proteins are identified by either Coomassie blue 
staining (S) or Western blot analyses (W) using the standard 
HIH HIV positive immunoglobulin. As shown in Figure 2, lanes 

35 1, 2 and 3 represents the lysates of AcNPV-HIVYHpol 

recombinant virus infected cells, lanes 4, 5 and 6 represent 
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the lysates of AcNPV-HIVYKpol recombinant virus infected 
cells, lane 7 shows the wild-type AcNPV infected cell 
lysate, lane 8 shows uninfected cell lysate and lane 9 shows 
molecular weight markers. Lane 3 and 6 show the whole cell 
5 lysate, lanes 2 and 5 show proteins in the infected cell 
nuclei and lanes 1 and 4 show proteins in the infected cell 
cytoplasm. P denotes polyhedrin protein and arrows show 95K 
Dal uncleaved pol gene product representing 91 amino acid 
deletion of protease produced by AcNPV-HIVYKpol virus and 
10 66K Dal, 51 K Dal and 33K Dal processed pol gene products in 
AcNPV-HIVYHpol virus infected cells. 

EXAMPLE 3 

A. Production of pol gene produrrt- 

Recombinant ACNPV-HIVYKpol virus infected Soodont^, 
15 fruqiperda (SF9) cells were harvested 4 days after 

infection. Nuclei of infected cells containing most of the 
pol gene product were isolated by treating the infected 
cells with 0.1% Triton X-100 and 0.5% NP40 on ice for 20 
minutes followed by centrifugation at 750 g for 10 minutes. 
20 The pelleted nuclei were denatured with 1% SDS in TRIS-HC1 
PH 8.0 at room temperature for 3 0 minutes. The cellular DNAs 
were removed by ethanol precipitation using 2 volumes of 
100% ethanol. The SDS in the solution were removed by add- 
ition of 25 mM KCL incubated at 4'C for 30 minutes' followed 
25 by centrifugation at 12,700 g for 15 minutes. The -pol gene 
product in the supernatant was used for anti-pol ELISA. 

B. Detection of Kiv antibodies h Y F TTg& 

The pol antigen was diluted in PBS and dispensed in a 
microtiter plate (Nunc cat 269620). The concentration of pol 
3 0 to coat plates was determined empirically on the strength of 
bands on polya cry 1 amide gels. 

The concentration of pol necessary to coat one well was 
between 1 and 10 *tg. 

The plate was covered and incubated at 4*c. The time of 
35 incubation varied between 12 and 24 hrs without no apparent 
differences in reactivity. 
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The plates were then washed three times in PBS tween 20 
employing a Skatron plate washer. 

Various standards, NIH HIV+ immunoglobulin (NTH STD) , 
pool HIV+ plasma (PAT STD) and plasma from non-infected 
5 individuals (NS) were employed. The standards were diluted 
beginning at 1:200 for NIH STD, and 1:10 for PAT STD and NS. 
Unknowns were tested usually at 1:50 but dilutions as high 
as 1:10 can be employed. 

All samples were inactivated before testing. Normal sera 

10 were processed in the same fashion as sera from AIDS 
patients. The inactivation was performed with 
4 • -aminoethyltrioxsalen- hydrochloride (AMT) from Lee 
Biomolecular Research Inc. (San Diego, California cat 231) 
and an ultra violet light trans-illuminator (Spectroline 

15 model TC-365, Fisher Scientific Ottawa Ont.). The AMT was 
reconstituted in 50% ethanol at 1 ng/ml. The sera was 
aliquoted in Eppendorf tubes and for every 100 nl of serum 
or plasma, 10 of AMT was added to the sample. The 
samples were layed in the transilluminator and irradiated 

20 for 5 minutes. An additional 10 fil of AMT was added to the 
sample and the samples were irradiated for a further 5 
minutes. The samples were inactivated by this procedure. 

The incubation time of the human-anti-pol was 30 to 40 
minutes at room temperature (23 'C) (the time of incubation 

25 found to be quite critical) . Therefore, all dilutions of 

standards (negative and positive) and unknowns was performed 
in a separate plate. Once all dilutions were done, the 
dilutions (100 /il) were transferred to the ELISA plate 
coated with pol employing a multichannel pipettor. All 

30 dilutions were with PBS Tween 20 (0.1%). 

The state of the serum or plasma sample was found to be 
important. Samples repeatedly frozen and thawed usually gave 
higher backgrounds. This was especially evident with samples 
from normal individuals. 

35 The plates were washed three times in PBS-Tween 20 after 

the 30 minute incubation with the first antibody. 
A Skatron II plate washer was employed for this purpose. 
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The second antibody used (goat anti-human Ig linked to 
horse radish peroxidase) was an affinity purified reagent 
obtained from Tago Diagnostics (Inter Medico To DNT cat 
2393). An appropriate dilution was determined experimen- 
5 tally (approximately 1:2,000) is made in PBS-Tween 20 

(0.1%). 100 nl was dispensed into the wells except for one 
which will be employed as a blank for the plate reader. The 
plate was incubated for 1 hour at room temperature. 

The plates were washed three times with PBS-Tween 20 
10 employing the Skatron II plate washer. 

Freshly prepared substrate (100 fil) was added to the 
wells and after 20 minutes the reaction stopped with the 
addition of 100 fil of 0.07M H2SD4. 

The plate was read at 450 nm in the BIOTEK BL/310 ELISA 
15 plate reader. A hard copy of the data was obtained from the 
reader and the data also stored directly onto computer 
diskette for further processing by the Anelisar program. 

Additionally, controls were also performed on each 
plate. In two or three wells no serum or plasma was added. 
20 in one well no primary or secondary antibodies were added 
but substrate was. This well was employed to blank the ELISA 
plate reader. The remaining wells were employed to determine 
the extent of binding of the secondary antibody (Goat 
anti-HIg-HRPO) to POL. Thus, these wells received no primary 
25 antibody but secondary antibody and substrate with the 

appropriate washes in between each incubation. Usually the 
value of this latter control is below 0.1000 OD. 
The results are shown in Figure 3. 

The following materials were use for the anti-pol ELISA 
30 procedure 

Buffers 

Phosphate Buffered Saline fPRg) 

Na 2 HP0 4 (dibasic anhydrous) 13.6 g 

NaH 2 P0 4 (monobasic) 2.4 g 

35 Nacl 90.0 g 

Salts are dissolved in 8 litres of distilled deionized 
water and pH is adjusted to 7.2 with NaOH or HC1. This 
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buffer is employed as coating buffer, diluent and washing 
buffer. The latter two buffers are modified as indicated 
below. 

Diluent for primary and sec ondary antibodies and washing 
5 buffer 

PBS +0.1% Tween 20 (Sigma, St. Louis MO) (0.1 ml Tween 
20 + 100 ml PBS) . The diluent buffer is made up daily. 
Substrate buffer 

Equal volumes of 0.1M Na 2 HP0 4 ( 0.7 09 g/50 ml) and 0.1M 
10 citric acid (0.960 g/50 ml). The pH is adjusted to 4.0 with 
NaOH or HC1. The substrate buffer is made up weekly. 
Substrate 

A tablet (2 mg) of o-phenylenediamine (Sigma cat. P6787) 
is dissolved into 10 ml of substrate buffer. Hydrogen 

15 peroxide (4 fil of 30%) is added to the solution just prior 
to plating. The solution should be kept in the dark as much 
as possible. 
Stopping reagent 

The enzymatic reaction is stopped with 0. 07M H,SO,. 

20 It is a particularly advantageous feature of the 

polypeptides, the use of which is described herein, that 
they cross-react with antibodies against diverse strains of 
HIV. Thus, for example, the polypeptides described herein 
based on HIV-1 can cross-react with antibodies raised 

25 against various strains of HIV-l and HIV-2. Thus they may be 
used in diagnostic kits for detecting either virus category. 
Similarly, in vaccines they can provide broad-spectrum 
protection. 

Industrial Applicability 
30 As will be apparent from the above, the present 

invention can be used in the medical field for testing for 
HIV infection and for immunizing against HIV infection, as 
well as for other diagnostic or prognostic purposes. 
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TABLE 1 



HIV-l pol protein sequence of HIVHXB2 virus 
Data from Human Retroviruses and AIDS 1988 
Los Alanos National Laooratorv 

AcNPV-HlVWHpol 

Hr7IIXB2Met£h^heArgGluAspLeuAloPhoLeuGlnGlvLysAlaArgGluPheSerSerGlu 

HIVBH1B2 -Jfe ; ■ ; * 

——Bin 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



■Gin 



— Asn Pro- 




Gly Leu ProLys— 



19 
20 
20 
20 
20 
0 
19 
19 
19 
19 



KIVHXB2 

HIVBH1 02 ThrArgAlaAsnSerPpoThrlleSerSerGlu 
HIYBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



GlnThrArgAlaAsnSerProThrArg 



ThrArgAlaAsnSerProThrlleSerSerGlu- 
ThrArgAlaAsnSerProThrlleSerSerGlu- 
ThrArgAlaAsnSerProThrlleSerSerGlu- 



— Ser 
— Ser 



28 
40 
40 
40 
40 
0 
28 
28 
28 
28 



HIVHXB2 

HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



ArgGluLeuGlnValTrpGlyArgAspAsnAsnSerProSerGluAlaGlyAlaAspArg 



— — GlyGlu Leu 

Gly . . .LysThrLeu Thi Glu- 

• .ProLeu LysThr Glu- 



48 
60 
60 
60 
60 
0 
48 
47 
47 
47 



HIVHXB2 

HIVBH1 02 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



GlnGlyThrValSerPheAsnPheProGlnVolThrLeuTrpGlnArgProLeuValThr 

-lie- 
— He- 
-He— 



Ile- 



— Ser- 
-Ser- 



He— 

lie- 

— Ile- 
— He- 



Vol- 



-Ala 



68 
80 
80 
80 
80 
0 
68 
67 
67 
67 
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Table 1 cont'd 



HIVHXB2 

HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



HIVHXB2 

HIVBH1 02 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



<- gag cds end 

IleLysIleGlyGlyGlnLeuLysGluAlaLouLeuAspThrGlyAloAspAspThrVal 



Arg 

Vol 

VolArgVal- 



AcNPV-HIVYKpol starts 
LeuGluGluVtetS^rLeuProGlyArgTrpLysProLysMetlleGlyGlylleGlyGly 



• • • • — — Asiv 

Asn- 

— Asn- 

IleAsn- 

Asn- 



-Arg- 



-Lys- 
-Lys- 
-Lys- 
-Lys- 



88 
100 
100 
100 
100 
0 
88 
87 
87 
87 



108 
120 
120 
120 
120 
17 
108 
107 
107 
107 



HIVHXB2 

HIVBH1 02 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



PhelleLysValArgGlnTyrAspGlnlleLeuIleGluIleCysGlyHisLysAlalle 



-Thi Gly- 

-ProVal 



128 
140 
140 
140 
140 
37 
128 
127 
127 
127 



HIVHXB2 GlyThrVQlLeuValGlyProThrProValAsnllelleGlyArgAsnLeuLeuThrGln 

HIVBH102 

HIVBH5 ZZZ ZZ 

HIVPV22 „ 

HIVBRU I~-ZIZI_~ 

HIVMN ~IIZ~~~ 

HIVSF2 

HIVRF - . 

HIVELI """" 



148 
160 
160 
160 
160 
57 
148 
147 
147 
147 
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Table 1 cont'd 



\/ p66, p51 

SJm? 02 ?JJ^ 1 60 

HIVBH5 ZZZZZZZZZZZ Z~Z~Z™ 180 

HIYPV22 ZZZZZZZ ' 180 

hivbru ZZZZZZ ZZZZZZZ I 180 

HIVMN Leu ZZZZZZZZZ ZZ Z "~ 180 

HIVSF2 ~ 2 "II _ 77 

HIVRF ~ IZZIZZ" 168 

HIVELI 1 " 1 67 

167 

h^bh?02 i as 

HivBHs ZZZ ZZZZZ ZZ — ' — 200 

hivpv22 ZZZ ZZZZ " 200 

HIVBRU ZZZZZZZZ ZZZ Z " 200 

hivmn Z—ZZZZZZ Z ' — 200 

hivsf2 ZZZZZZZZZ ZZ ; — 97 

HIVMAL ArgZZZZZZZZ 187 

hiveli ZZZZZZZZZZZZZZ" z 187 

SIm«2 ^11^ 208 

hivbh5 ZZZZZZZZZZZZZZZZZ Z 220 

HIVPV22 ZZZZ „ 220 

hivbru ZZZZZZZZZZZ—ZZ ~~ " — ~ — 220 

hivmn — ii© ZZZZZ Z ZZ Z_ ' 220 

hivrf ZZZZZZ : — 5 — 208 

HIVMAL Thr LysAsp ZZZZZZZZZZZ , " 207 

HIVELI —Thr I!^p ZZ LeU 7 207 

A5 P Arg 2fl7 

EJ™2 *fZ!Z!llZ*!^^ 228 

hivbhs zzzzzzzz _zzzzz~ 2I *° 

hivpv22 ZZZZ_ZZZZZ _ ' — 2 ^ 0 

hivbru ZZZZZZ. Z ZZZZ_ ' 2f|0 

HIVMN ZZZ ZZ 2I * 0 

hivsf2 zzzzzzzz „zz_ Z. ' — 137 

hivrf ZZZZZZ Z Z Z -7ZZZZ — 228 

HIVELI Ile-ZZZZ"ZZ " 227 

227 
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Table 1 cont'd 

* 



HIVHXB2 LeuValAspPheArgGluLeuAsnLysArgThrGlnAspPheTrpGluValGlnLeuGlY 248 

HIVBH102 260 

HIVBH5 Arg ; 260 

HIVPV22 260 

HXVBRU 260 

HIVMN Lys 1 57 

HIVSF2 248 

HIVRF ■ 247 

HIVMAL Asn 247 

HIVELI 247 

HIVHXB2 IloProHisProAlaGlyLeuLysLysLysLysSerValThrValLeuAspValGlyAsp 268 

HIVBH102 ^ 280 

HIVBH5 280 

HIVPV22 280 

HIVBRU 280 

HIVMN 177 

HIVSF2 — 268 

HIVRF 267 

HIVMAL 267 

HIVELI 267 

HIVHXB2 AlaTyrPheSerValProLeuAspGluAspPheArgLysTyrThrAlaPheThrllePro 288 

HIVBH102 • 3O0 

HIVBH5 300 

HIVPV22 300 

HIVBRU ■ 300 

HIVMN Lys T 97 

HIVSF2 Lys 288 

HIVRF LysGlu 287 

HIVMAL 287 

HIVELI „Ser 287 

HIVHXB2 SerlleAsnAsnGluThrProGlylleArgTyrGlnTyrAsnValLeuProGlnGlyTrp 308 

HIVBHT 02 320 

HIVBH5 SerGly , 320 

HIVPV22 320 



HIVBRU 

HIVMN 

HIVSF2 



320 
217 
308 



HIVRF Arg 3C7 

HIVMAL 307 

HIVELI 307 
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Table 1 cont'd 



HIVHXB2 

HIVBH102 

HIYBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIYELI 



HIVHXB2 
HIVBH102 
HIYBH5 
HIYPV22 
HIVBRU 
HIVMN 
HIVSF2 



LysGlySerProAlallePheGlnSerSerMotThrLysIleLeuGluProPheArgLys 328 

— ^ — ^ 



340 
340 

340 

237 

328 

-Lys 327 



-Thr 327 
327 



!™f"™^ 348 



HIVBH5 
HIVPV22 



HIVMAL 
HIVELI 



HIVBH5 Phe- 

HIVPV22 

HIVBRU 



360 
360 
360 
257 



HIVRF Glu Z L. _~ _~ ' 348 

HIVMAL Lys Glu Z ~ ZZZZZ *** 7 

HIVELI GluMet Z~~ ~ 



347 
347 



SzvBmn !^ff*I^ 368 



380 



HIVBRU ~ 380 

hivmn aiq — ""~"~:::_ Ar \*% 

HIVSF2 Ar9 277 



HIVRF Ile ~_ 267 



■Glu -Lys 367 



367 



SJIESL ^l U J^^^ 388 



400 
400 
400 



HIVMN Phe ~Z1ZZ 400 

HIVSF2 Phe I JZ 297 

HIVRF Phe ZJZZ I ~ "~ 388 

HIVMAL Phe ~ JZ ? 387 

HIYELI Phe Arg Z ZZZZ " ~ " 



387 
387 
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Table 1 cont'd 

— — — y 



HIVHXB2 LeuHisProAspLysTrpThrValGlnProIleValLeuProGluLysAspSerTrpThr 408 

HIVBH102 420 

HIVBH5 II© 420 

HIVPV22 420 

HIVBRU 420 

HIVMN 317 

HIVSF2 Met 408 

HIVRF 407 

HIVMAL Gin Asp Glu 407 



HIYHXB2 ValAsnAspIleGlnLysLeuValGlyLysLeuAsnTrpAlaSerGlnlleTyrProGly 428 

HIVBH102 440 

HIVBH5 443 

HIVPV22 440 

HIVBRU A 40 

HIVMN Ala 337 

HIVSF2 Ala 4 28 

HIVRF Ala 427 

HIVMAL 427 

HIVELI —Asn GluArg ~ 427 



HIVHXB2 IleLysValArgGlnLeuCysLysLeuLeuArgGlyThrLysAlaLeuThrGluVallle 448 

HIVBH102 . 4B0 

HIVBH5 460 

HIVPV22 460 

HIVBRU 460 

HIVMN Lys 357 

HIVSF2 Lys * 448 

HIVRF Lys . Vol 447 

HIVMAL Lys Alo AspIloVal 447 

HIVELI . 447 



HIVHXB2 ProLeuThrGluGluAlaGluLeuGluLeuAlaGluAsnArgGluIleLeuLysGluPro 468 

HIVBH102 4 S 0 

HIVBH5 . 430 

HIVPV22 

HIVBRU _ 

HIVMN 

HIVSF2 



HIVMAL AlQ- 

HIVELI 



480 
480 
377 
468 



467 
467 
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Table 1 cont'd 



ut»b!! B «o ValHisGlyVQlTyrTyrAs P ProSerl -VsAspLeuIleAlaGluIleGlnLysGlnGlv 

H 1 VHH1 (- 1 ^_ _ . , . * 



HIVBH102 
HIVBH5 



HIVBRU 



HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



488 
500 
500 
500 
500 
397 
488 
487 
487 
487 



SISSw f lnGlyGlnTrpThrTyrGl ^ 



HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI His 



— GlnTyr— 



508 
520 
520 
520 
520 
417 
508 
507 
507 
507 



HIVBH102 IZttlt^ 

HIVBH5 ZZZZZZZZZZZZZZZZ ZZ~ZZZZ 

HIVPV22 ZZZZZ«Z~ZZ Z ~_ — ~ 

HIVBRU Thr ZZZZZZZ~_~Z_ Z_Z 

HIVMN Z_~_~ " 

HIVSF2 ^ Z.ZZZ-Z"Z~Z_~ _Z_ ~ 

hivrf ZZZZZZZ ~ _zz _ — 

HIVMAL IleLysSer ZZZZZZZZZZZZZZZZ" ~_Z 

HIVELI _ZZZ~ A1 



528 
540 
540 
540 
540 
437 
528 
527 
527 
527 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 



HIVSF2 ValSer 

HIVRF 
HIVMAL 

HIVELI Arg Sei 



VolAla 

AlaGln 



He 



Arg— 



Arg— 



548 
560 
560 
560 
560 
457 
548 
547 
547 
547 
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Table 1 cont'd 



HIVHXB2 GlnLysGluThrTrpGluThrTrpTrpThrGluTyrTrpGlnAlaThrTrpIleProGlu 568 
HIVBH102 300 



HIVBH5 

HIVPV22 Z'. 

HIVBRU 

HIVMN Thr +++ i 

HIVSF2* Ala Met 

HIVRF Ala 

HIVMAL Ala 

Ala 



HIVELI 



HIVSF2 
HIVRF 
HIVMAL 
HIVELI 



HIVELI lie- 



HIVHXB2 
HIVBH1 02 
HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 
HIVMAL 
.HIVELI 



-Ser— 



Asp SerlleAl 



"~"ZZ — j sp Ser — 



580 
580 
580 
580 
477 
568 
567 
567 
567 



™u!L TrpGluPheValAsnThrProProLeuVa lLysLeuTrpTyrGlnLeuGluLv S GluPro 588 
HXVBHl 02 "* —— *"~' — "™~- — — —————— —————— ————— 



HIVBH5 _ 

HIVPV22 ~~I-ZI.~~~ 

HIVBRU ~~~ 

HIVMN Vol ~~ 600 



600 
600 
600 



497 

588 

587 

Thr 587 



587 



IleValGlyAlaGluThrPheTyrValAspGlyAlaAlaAsnArgGluThrLysLeuGly 608 



HIVHXB2 
HIVBH102 

HIVBH5 ~~ I™ " 620 

620 

-Arg 620 

Ser- 



-Ser 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF He 

HIVMAL " 607 

— Lys 607 



620 

Lys 517 

608 



607 



LysAlaGlyTyrValThrAsnArgGlyArgGlnLysValValThrLeuThrAspThrThr 628 

" L y s Pro Asn 640 

" ' His 640 

~"""""~!:!"Z""^ ys "" Pro Asn 

640 

537 



a 628 

627 
627 
627 
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Table 1 cont'd 



SvBH102 Tltl^ 648 

HIVBH5 "~~His-"~~~ ~ 660 

HIVPV22 Z_Z_Z~ZZ " 660 

HIVBRU --Hil ZZ " 660 

HIVMN ZZZZ- H isZZ ZZ " 660 

HIYSF2 --His-"--— ZZZZ Z ' 557 

HIVMAL ZZZZZZZZZZZ u^l Gfy7 

HIVELI ZZ^nZZZZZZZZZZZZZZZZ ^ZZ 647 

H^Bm02 ^^^^ BBS 

hivbh5 ZZZZZZ_ZZZZZZZZZZZZZZ ^ — Lys ~ — 680 

HIVPV22 *~ys 680 

HIVBRU ZZZZZ ~ 680 

HIVMN ZZZZZ LyS 680 

HIVSF2 ZZZZZ LyS 577 

HIVRF ZZZZZZZZZ Z L Y S 668 

HIVMAL ZZZZZZ" ZZZ Z_ ~~ Lys 667 

hiveli ZZZZZZZZZZZ — Lys — 667 

hIIZw llTflt^ BBS 

HIVBH5 ZZ.ZZZZ ~ ' 700 

HIVPV22 ZZZZZZZZZZ. Z — 700 

hivbru ZZZZZZZZZZ Z Gln 700 

hivmn Ser ZZZZZZZZZ ZZZZ ~ — 700 

hivsf2 Ser ZZZZZZZZZZ "Z — — 597 

HIVRF Ser ZZ ZZ 688 

HIVMAL He ZZ " ~~ 687 

hiveli t ZZZZZZZZZZZ" ZZ Gln — Asp — " 687 

!ra™K tllllTll:^^^^^^ 7 0 8 

HIVBH5 ZZZZZZ Z Z_ ~~ 720 

HIVPV22 ZZZZ 720 

hivbru ZZZZZZZZZZZZ * ~ — 720 

hivmn ZZZZZZZZZZZ " — 720 

hivsf2 ZZZZZ ZZZ Z™ ~ — 617 

HIVMAL Ser ZZZZZZZZZZZZZZ~ZZZ ~ Ar 9"~"^ — 7 07 

HIVELI ZZ^_~ Z Z Z" — - — — 707 
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Table 1 cont'd 



HIVHXB2 AlaGlylleArgLysValLeuPheLeuAspGlylleAspLysAlaGlnAspGluHisGlu 728 



HIVBH102 lie- 



HIVPV22 Ile- 

HIVBRU 
HIVMN 



740 



HIVBH5 He — Glu 740 



740 
740 



~ GluAsp 637 

HIVSF2 Asn Glu • 728 

HIVRF Thi 727 

HIVMAL Ser Glu 727 

HIVELI Gin Glu 727 



HIVHXB2 LysTyrHisSerAsnTrpArgAlaMetAlaSerAspPheAsnLeuProProValValAla 748 

HIVBH102 , 760 

760 

760 

760 

„Xle 657 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 

HIVRF 

HIVMAL Ile 

HIVELI —Asn 



748 
747 
747 
747 



HIVHXB2 LysGluIleValAlaSerCysAspLysCysGlnLeuLysGlyGluAlaMetHisGlyGln 768 
HIVBH102 — 



HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



780 
780 
780 
7O0 
677 
768 
767 
767 
767 



HIVHXB2 ValAspCysSerProGlylleTrpGlnLeuAspCysThrHisLeuGluGlyLysVallle 788 
HIVBH102 



HIVBH5 
HIVPV22 
HIVBRU 
HIVMN 
HIVSF2 
HIVRF 



800 

800 

800 

800 

697 

lie 788 



He 787 

HIVMAL Ile 7Q7 

HIVELI _ 787 
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Table 1 cont'd 



HIVBH5 ZZZZZZZZZ II ~ 

HIVPV22 ~ ~~~~ _ 

HIVBRU ~~~~~ 

hivsf2 ~ zzzzzzzzzzz z~z — 

HIVRF Z ™" " — 

HIVMAL lie IZZ" I * — 

HIVELI ~ 



808 
820 
32 0 
820 
820 
717 
808 
807 
807 
807 



HIVHXB2 

HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 




Valval 
-Valval 



828 
84 0 
840 
840 
840 
737 
828 
827 
827 
827 



HIVHXB2 

HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



HisThrAspAsnGlySerAsnPheThrGlyAlaThrValArgAlaAlaCysTrpTr 
~~ ~ Ser Lys 

" ~ Ser Lys 

SerThr Lys _ _ 

Pro SerThr Lys 

SerThr Lys— 

SerThr Lys ■ 

"~ Ser Ala Lys 

Ser Ala Lys — 



pAla 



■Thr 



848 
860 
860 
860 
860 
757 
848 
847 
847 
847 



HIVHXB2 

HIVBH102 

HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF 

HIVMAL 

HIVELI 



GlylleLysGlnGluPheGlylleProTyrAsnProGlnSerGlnGlyValValGluSer 



Ile- 



Asn- 



868 
880 
880 
880 
880 
777 
868 
867 
867 
867 
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Table 1 cont'd 



HIVHXB2 MetAsnLysGluLeuLysLysIlelleGlyGlnValArgAspGlnAlaGluHisLeuLys 088 

HIVBH102 900 

HIVBH5 900 

HIVPV22 900 

HIVBRU 900 

HIVMN 797 

HIVSF2 Asn 888 

HIVRF — Gin Gin 887 

HIVMAL Glu 887 

HIVELI 887 

HIVHXB2 ThrAlaValGlnMetAlaVolPhelleHisAsnPheLysArgLysGlyGlylleGlyGly 908 

HIVBH102 920 

HIVBH5 ; 920 

HIVPV22 • 920 

HIVBRU ■ 920 

HIVMN Arg 817 

HIVSF2 — 908 

HIVRF 907 

HIVMAL 907 

HIVELI ArgArg 907 

HIVHXB2 TyrSerAlaGlyGluArglleValAspIlelleAloThrAspIleGlnThrLysGluLeu 928 

HIVBH102 940 

HIVBH5 940 

HIVPV22 940 

HIVBRU 9^0 

HIVMN Gly 837 

HIVSF2 928 

HIVRF 927 

HIVMAL He Met 927 

HIVELI He 927 

HIVHXB2 GlnLysGlnlleThrLysIleGlnAsnPheArgValTyrTyrArgAspSerArgAsnSer 948 

HIVBH102 Pro 960 

HIVBH5 Pro 960 

HIVPV22 Pro 960 

HIVMN AspPro 857 

HIVSF2 AsnLysAspPro 948 

HIVRF < -AspPro 947 

HIVMAL ■ Asn AspPro 947 
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Table 1 cont'd 



HIVBH102 tlll7y^ 968 



980 
980 
980 
980 
877 
968 



HIVBH5 

HIVPV22 

HIVBRU 

HIVMN 

HIVSF2 

HIVRF His _ 

HIVMAL He ~_ 967 

HIVELI He Z 967 

967 



£™J 2 *z"?!:*:!^^ *** 

HIVBH5 ~ " 1 000 

HIVPV22 ZZJZ ~ JZ 1000 

HIVBRU Z 1000 

HIVMN Asn ~ ~I 1000 



HIVSF2 
HIVRF 



897 
988 



HIVMAL ~ 987 

HIVELI Lys ZZZZJZ 



Vol 



987 
987 



GlnMetAlQG1 y As P As P C y sVQ lAlaSerArgGlnAspGluA S p +++ i 00 * 

1016 
1016 
1 01 6 
1016 
913 
1004 

1003 

GlyGly 1003 



HIVBH5 

HIVPV22 

HIVBRU 



HIVMN Thr _ 

HIVSF2 Z ~~ZZ. 

HIVRF ~~~ZI_Z Z'ZZ 

HIVMAL 

HIVELI 



1003 
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TABLE 2 

HIV-l pol gene HIVHXB2 Sequence 

Data from Human Retroviruses and AIDS, 1988 

Los Alamos National Laboratory 

AcNPV-HIVW Hpol Virus 

* RBS 

Bam HI ti 
5 1 -GGATCCTATAAATATG tttttta gggaagatct 
pol cds start (NH2-terminus uncertain) -> 



2101 


ggccttccta 


caagggaagg 


ccagggaatt ttcttcagag 


cagaccagag 


ccaacagccc 


2161 


caccagaaga 


gagcttcagg 


tctggggtag 


agacaacaac 


tccccctcag 


aagcaggagc 


2221 


cgatagacaa 


ggaactgtat 


cctttaactt 


ccctcaggtc 


actctttggc 


aacgacccct 


2281 


cgtcacaaTA Aagatagggg ggcaactaaa 
<- gag cds end 


ggaagctcta 


ttagatacag 


gagcagatga 


2341 


tacagtatta* 


□aaaaaataa 


gtttgccagg 


aagatggaaa 


ccaaaaatga 


■tagggggaat 


2401 


tggagatttt 


atcaaagtaa 


gacagtatga 


tcagatactc 


Dtagaaatct 


gtggacataa 


2461 


agctataggt 


acagtattaq 


taggacctac 


acctgtcaac 


atoattggaa 


gaaatctgtt 


2521 


gactcagatt 


ggttgcactt 


taaattttcc 


cattagccct 


attgagactg 


taccagtaaa 


2581 


attaaagcca 


ggaatggatg 


gcccaaaagt taaacaatgg 


ccattgacag 


aagaaaaaat 


2641 


aaaagcatta 


gtagaoattt 


gtacagagat 


ggaaaaggaa 


gggaaaattt 


caaaaattgg 


2701 


gcctgaaaat 


ccatacaata 


ctccagtatt tgccataaag 


aaaaaagaca 


gtactaaatg 


2761 


gagaaaatta 


gtagatttca 


gagaacttaa 


taagagaact 


cooaacttct 

■mw mm mm m^ m* mr W 


nr^n +* +■ r» m 

yyy uu y i-i#cq 


2821 


attaggaata 


ccacatcccg 


cagggttaaa 


aaagaaaaaa 


tcagtaacag 


tactggatgt 


2881 


gggtgatgca 


tatttttcag 


ttcccttaga 


tgaagacttc 


aggaogtata 


ctgcatttac 


2941 


catacctagt 


ataaacaatg 


agacaccagg 


gattagatat 


cagtacaatg 


tgcttccaca 


3001 


gggatggaaa 


ggatcaccag 


caatattcca 


aagtagcatg 


acoaaaatct 


tagagccttt 


3061 


tagaaaacaa 


aatccagaca 


tagttatcta 


tcaatacatg 


gatgatttgt 


atgtaggatc 


3121 


tgacttagaa 


atagggcagc 


atagaacaaa 


□atagaggag 


ctgagocaoc 


atctgttgag 


3181 


gtggggactt 


accacaccag 


acaaaaaaca 


tcagaaagaa 


cctccattcc 


tttggatggg 


3241 


ttatgaactc 


catcctgata 


aatggacagt acagcctata 


gtgctgccag 


oaaaagacag 


3301 


ctggactgtc 


aatgacatac 


agaagttagt 


ggggaaattg 


aattgggcaa 


gtcagattta 


3361 


cccagggatt 


aaagtaaggc 


aattatgtaa 


actccttaga 


ggaaccaaag 


cactaacaga 


3421 


agtaatacca 


ctaacagaag 


aagcogagct 


agaactggca 


gaaaocagag 


agattctaaa 
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Table 2 cont'd 



3481 agaaccogto catggagtgt attotgaccc atcooaogac ttoatogcag aaatacogaa 

3541 gcoggggcaa ggccaatgga catatcaaat ttotcaagag ccatttaaaa atctgoaaoc 

3601 oggaaaotat gcaagaatga ggggtgccca cactaatgat gtaaoacaat taacagoggc 

3661 agtgcoaaaa otaoccacag aoagcatagt aatatgggga aagactccto aatttoaoct 

3721 gcccatacaa aoggaoacat: gggoaacatg gtggacagag tattggcaog ccacetggat 

3781 tcctgogtgg gagtttgtto atacccctcc cttagtgoaa ttatggtacc agttagagaa 

3841 agaacccata gtoggagcag oaoccttcta tgtagatggg gcagctaaca gggagactoa 

3901 attaggaaoa gcaggatotg tta ctootog oggaagacaa aaogttgtca ccctaactga 

3961 cacaacaaat cagaagactg ogttacaagc aatttatcta gctttgcagg attcgggatt 

4021 agaagtoaoc atagtaacag actcacaata tgcottagga atcattcaag cocaoccago 

4081 tcaaagtgaa tcagogttag tcootcoaat aatagogcog ttaatoaoaa oggaaoaggt 

4141 ctatetggca tgggtaccag cocacaaagg oattggagga oatgoacaag tagatoaott 

4201 agtcagtgct ggaateogga aagtactott tttagatggo atogotaogg cccaagatga 

4261 ocatgagaaa tatcocagta attggogagc aatggctagt gattttaacc tgccacctgt 

4321 ogtogcaaaa goaotogtog ccagctgtgo toaatgtcag ctaaaaggag oagccatgca 

4381 tggacaagta gactgtogtc caggaatatg gcaactagat tgtacacatt togaaggaao 

4441 agttatcctg gtagcagttc atgtogccag tggatatata gaagcagaag ttattccogc 

4501 agaaacoggg coggaaacag cotattttct tttoooatta gcoggaagat ggccagtooa 

4561 aaccatacat aetgacaotg gcogeaattt caccggtgct acggttaggg ccgcctgttg 

4621 gtgggcggga atcaagcagg aatttggaot tccctacaat ccccaaagtc aoggagtagt 

4681 agaatctatg aatoaagaat taaogoaaat tataggoeag gtoogogatc aggctgaaca 

4741 tcttaogoca gcagtococra tggcagtatt catccacoat tttoaaagaa aaggggggat 

4801 tggggggtac agtgcogggg aaagaatogt agacatoata gcaacogaca tacoaoctoa 

4861 agaattacaa aaacoaatta caaoaattca aaattttcgg gtttottocA Gggocogcag 

/\ 3'sj. 

4921 oaottcoctt tggoaoggac cogcoaogct cctctggaao gGTgaagggg cogtagtaat 

5'sj /\ 

4981 ocaogataat ogtgocataa oogtagtgcc oagoagoooa gcoaagatco ttogggottA 

sor 23 kD cds start -> 
5041 TGgaaooeog o t ggcogg t g ctgottgtgt ggcaogtogo caggo tQOOO . tTA jg dS_„ 

Crossover <- pol end 

linker Sequence 
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TABLE 3 

HIV-l pol gene 
HIVHXB2 Sequence 

Data from Human Retroviruses and AIDS, 1988 
Los Alamos National Laboratory 

AcNPV-HIVWHpoI Virus 

RBS 

Bain 111 ti (Acnpv-hivwhdoI start) 

5 ' -GGATCCTATAAATATG tttttto gggaagatc t 
pol cds start (NH2-terminus uncertain) -> 



2101 


ggccttccta 


caagggaagg 


ccagggactt 


ttcttcagag 


cagaccogag 


ccaacagccc 


21 SI 


caccagaaga 


gagcttcagg 


tctggggtag 


agacaacaac 


tccccctcag 


aagcaggagc 


2221 


cgatogacaa 


ggaactgtat 


cctttaactt 


ccctcoggtc 


actctttggc 


aacgacccct 


2281 


cgtcacaaTA 


Aagatagggg 


ggcaactoaa 


ggaagctcta 


ttagatacog 


gagcagatga 


AcNPV-KIVYKdoI 


Bam III 


BBS ti 








Virus 


5 ■ - GGATCCTATAAATATG 


(ti;AcNPV 


-HIVYKpol 


start) 


2341 


tacagtottav 


gaagaaatga 


gtttgccagg 


aagatggaaa 


ccaaaaatga 


tagggggaat 


2401 


tggaggtttt 


atcaaagtaa 


gacagtatga 


tcagotactc 


atogooatct 


gtggacataa 


2461 


agctataggt 


acagtattag 


taggacctac 


occtgteaac 


atQOttggaa 


gaoatctgtt 


i 




ggxxgcacxx 


toaattttcc cattogccct 


attgagactg 


taccagtaaa 


25B1 


attQOogcca 


ggaatggatg 


geceaaaagt taaocaatgg 


ccattgacag 


oagaaaaaot 


2641 


aaaagcatta 


gtagaoattt 


gtacagagat 


ggaaaaggaa 


gggaaaattt 


caaaaattgg 


2701 


gcctgaoaat 


ccatacaata 


ctccagtatt 


tgccotaoag 


aaaaoagaca 


gtactaaatg 


2761 


gagaaaatta 


gtagotttca 


gagaacttaQ 


taogagaoct 


caagacttct 


gggaagttca 


2B21 


attaggaata 


ccacatcccg 


cagggttaaa 


aaagaaaaaa 


tcagtaocag 


toctooatot 


2881 


gggtgatgca 


tatttttcog 


ttcccttoga 


tgaogacttc 


aggoagtata 


ctgcatttoc 


2941 


catacctogt 


ataaacaatg 


ogacaccagg 


gattogatat 


cagtacaatg 


tgcttccaca 


3001 


gggatggaaa 


ggatcaccag 


caatattcca 


aqgtogcotg 


ocaoaoatct 


tagagccttt 


3061 


tagaaaacaa 


aatccagaca 


togttotcta 


tcaotocatg 


gatgatttgt 


atgtaggatc 


3121 


tgacttagaa 


atagggcagc 


atogoGcoaa 


oatogaggog 


ctgagacaac 


atctgttgag 


3181 


gtggggactt 


accacoccag 


QCOQOQQOCa 


tcagaaogaa 


cctccattcc 


tttggatggg 


3241 


ttatgooctc 


cotcctgato 


aatggocagt 


ocagcctata 


gtgctgccag 


ooaaagacag 


3301 


ctggactgtc 


ootgacotac 


agaogttogt 


ggggaaattg 


oattgggcao 


gtcagattto 


3361 


cccagggatt 


ooagtoaggc 


oottatgtoo 


actccttaga 


ggaaccaoag 


cactaacoga 


3421 


agtaotacca 


ctaacagaog 


aagcagagct 


qgooctggca 


gaaaocagog 


ogattctaao 
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Table 3 cont'd 



3,81 ogocccogta catggcgtgt cttotgoccc ctcaaaogac ttootogccg oactocogao 
3541 gcoggggcoa ggcccatgga catatcaaat ttotcaagcg ccatttaaoa otctgooaac 
3601 aggaaaata* gcacgaotga ggggtgccco cactaatgat gfooacaat tcocogaggc 
3661 cgtgcaooaa otaaccacag oocgcotogt catatggggo aagactccta aatttaacct 
3721 geccatacaa acg gaQO co t gggaaacatg gtggacagog tattggcoag ccocctggot 
3781 tcctgagtgg gcgtttgtto atocccctcc cttcgtgaaa «o t gg tQCC QgttQgagaa 
38,1 ogoccccoto g tQ g gogcag oaoccttcta tgtagotggg gcogctaaca gggQgoctaQ 
3901 attaggcaaa gcoggototg ttoctactog aggaagacaa aoag^tco cccto^o 
39S1 cocoacaaa* cogcogcctg agttocoogc ootttatcta gctttgcogg ott: C gg gatt 
4021 agcog tooac Qtogtoocag octCQCQOtQ tgcottQggQ atCQ ^ CQQg 
,081 tcaaogtgca tcagogttag tcaotcaaot oa* 0 g 0gcag ttoatQaaQQ aggQQOQg ^ 
41,1 ctotctggca tgggtcccog cocacaacgg aattggagga aatgcacoag toga**-** 
4201 ag t c 0 g t gct ggaatcogga aagtoctott ttt og at g 0 o atog atoagg cccaogQtgQ 
4261 acatgagooa tctcacogta cttggogagc aotggctagt gottttaocc tgccacctgt 
4321 ogtogcaooa gaaotogtog ccogc t3tgQ t oaa t g tC ag Ctaaoaggag aogccQtgcQ 
4381 tggocoagto goctgtcgtc coggootatg gcaactagot tgtocacctt ta g Qa g gaaa 
4441 og ttotcctg g togcogttc Qtgtogccog tggQtQtQtQ gQogcQgoQg 

4501 ogoaccaggg caggaaacog catottttet tttaaaotto gcoggacgat ggccogtaac 
4561 oacoatccot octgccoctg gcogcao tt t caccggtgct acggttaggg ccgcctg ttg 
*S21 Stgggcggga otcaagccgg ootttggoct tccctacaat ccccooogtc aaggogtcgt 
4681 agaatctctg oatooogaa* tooogaoaot t0t0 g 9 acag gtaog Q g Qtc oggctgooco 
4741 tcttoogaca gcogtacaca tggcagtatt catccacaot tttaaacgao a a ggg gggat 
4801 t gggggg tQ c agtgcagggg aoagaatagt agocotoato gcaacogaca tccaccctcc 
4861 ogoottacoo aaacaoatta cooaaottca ooattttcgg gtttattacA ; Ggg QCQ gca g 
4S21 aaattcact* tggoocggoc cagcooogc* cc tCt gg aQO gGTgoogggg 

5 ' S J /\ 
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Table 3 cont'd 



4981 


acaagataat 


ogtgacataa 


aagtagtgcc 


aagaagaaoa 


gcaaagatca 


Bam HI 


5041 


TGgaaaacag 


atggcaggtg 


atgattgtgt 


ggcaagtaga 


caggatgagg 


atTAGGATCC 

■Sph I 
GCATG- 
<- pol end 


5101 


ggaaaagttt 


agtaaaacac 


catatgtatg 


tttcagggaa 


agctagggga 


tggttttatq 


5161 


gacatcacta 


tgaaagccct 


catccaagaa 


taagttcaga 


agtacacatc 


ccactagggg 


5221 


atgctagatt 


ggtaataaca 


acatattggg 


Dtctocatoc 


aggagaaaga 


gactggcatt 


5281 


tgggtcaggg 


agtctccata 


gaatggagga 


aaaagagata 


tagcacacaa 


gtagaccctg 


5341 


aactagcaga 


ccaactaatt 


catctgtatt 


actttgactg 


tttttcAGac tctgctataa 
/\ 3'sj 


5401 


gaaaggcctt attaggacac 


atagttagcc 


ctaggtgtga 


atatcaagca 


ggacataaca 


5461 agGTaggatc tctacaatac ttggcactag 
5'sj /\ 


cagcattaat 


aacaccaaaa 


aagataaagc 


5521 


cacctttgcc tagtgttacg 


aoactgacag aggatagATG 
R orf cds start -> 


gaacaagccc 


cagaagacca 


5581 


agggccacag 


agggagccac 


acaatgaatg 


gacacTAGag cttttagagg agcttaagaa 
<- sor 23 kD cds end 


5641 


tgaagctgtt 


agacattttc 


ctaggatttg 


gctccatggc 


ttagggcaac 


atatctatga 


5701 


aacttatggg 


gatacttggg 


caggagtgga 


ogccataata 


agaattctgc 


aacaactgct 



5761 gtttatccat tttcAGaatt gggtgtcgac aTAGcagaat aggcgttact cgacagagga 

/\ 3'sj <- R orf cds end 

5821 gagcaagaaA TGgagccagt ogatcctaga ctagagccct ggaagcatcc aggaagtcag 
tat cds start -> 

5881 cctaaaactg cttgtaccaa ttgctattgt aaaaagtgtt gctttcattg ccaagtttgt 

5941 ttcataacaa aagccttogg catctcctAT GgcAGgaaga agcggagaca gcgacgaaga 

trs/art cds start -> /\ 3'sj 

6001 gctcatcaga acagtcagac tcatcaagct tctctatcaa agcaGTaagt agtacatgta 

(tat, trs/art, 27 kD) 5'sj /\ 

6061 AcGcaaccta taccaatagt agcaatagta gcattagtag tagcaataat aatagcaata 
U orf -> 

6121 gttgtgtggt ccatagtaat catagaatat aggaaaatat taagacaaag aaaaatagac 
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CLAIMS : 

1. The use of a polypeptide as a reagent in a 
diagnostic test for HIV infection or as a vaccine against 
HIV infection characterized in that said polypeptide 

5 comprises a substantial portion of each of more than one of 
the enzymes coded for by the HIV-pol gene. 

2. The use claimed in claim l characterized in that 
said polypeptide comprises a plurality of enzymes selected 
from HIV-pol protease, HIV-pol reverse transcriptase, 

10 HIV-pol RNAse H and HIV-pol Integrase. 

3- The use claimed in Claim 2 characterized in that 

said polypeptide comprises substantial portions of all four 
of said enzymes. 

4. The use claimed in Claim 2 characterized in that 

15 said polypeptide omits at least that part of the amino acid 
sequence of the HIV-pol protease gene which codes for the 
active site responsible for proteolytic activity. 

5. The use claimed in Claim 3 characterized in that said 
polypeptide omits at least that part of the amino acid 

20 sequence of the HIV-pol protease gene which codes for the 
actxve site responsible for proteolytic activity. 

6. A diagnostic kit for detecting antibodies to HIV 
antigens characterized in that said kit contains as a test 
reagent, a polypeptide as defined in Claim l, claim 2, 

25 Claim 3, Claim 4 or Claim 5. 

7. A vaccine for protecting an individual against HIV 
infection comprising a polypeptide and a pharmaceutical^ 
acceptable carrier, characterized in that said polypeptide 
is as claimed in Claim 1, claim 2, Claim 3, Claim 4 or 

30 Claim 5. 

8 . a polypeptide comprising a substantial portion of 
each of more than one of the enzymes coded for by the 
HIV-pol gene characterised by omitting at least that part of 
the amino acid sequence of the HIV-pol protease gene which 

35 codes for the active site responsible for proteolytic 
activity. 
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9. A polypeptide as claimed in Claim 8 characterized by 

comprising sequences of a plurality of enzymes selected from 
HIV-pol protease, HIV-pol reverse transcriptase, HIV-pol 
RNase H and HIV-pol Integrase. 
5 10. A polypeptide according to Claim 9 characterized 

in that said polypeptide contains substantial portions of 
all four of said enzymes. 

11. A polypeptide according to Claim 8 characterized in 

that said polypeptide has an amino acid sequence 
10 substantially as shown in Table 3 beginning with the amino 
acid Met marked "AcNPV-HIVYKpol starts". 
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